Genomic Splicing Sites Prediction Algorithm Based on Nucleotide Sequence Pattern
نویسندگان
چکیده
Splicing sites prediction plays a major role in genomic research of biomedical science. If splicing sites of genomic sequence can be correctly predicted, numerous crucial problems in biomedical area may be resolved naturally. Some typical examples are finding pathogenic factors of serious diseases and developing new gene therapy and so on. The major difficulty in developing the method of predicting the splicing sites lies in the diversity of species, which leads to substantially different sequence patterns of splicing sites among various species. For instance, the existing splicing sites prediction algorithms cannot effectively predict the splicing sites of influenza virus. For this reason, we propose a new genomic splicing sites prediction algorithm in this study, which is based on a data mining technology and many biomedical findings, to discover the candidate patterns of splicing sites in the gene sequences, to accurately predict splicing sites in the gene sequences, and to detect many new candidate splicing sites.
منابع مشابه
Hidden Markov Model for Splicing Junction Sites Identification in DNA Sequences
Identification of coding sequence from genomic DNA sequence is the major step in pursuit of gene identification. In the eukaryotic organism, gene structure consists of promoter, intron, start codon, exons and stop codon, etc. and to identify it, accurate labeling of the mentioned segments is necessary. Splice site is the ‘separation’ between exons and introns, the predicted accuracy of which is...
متن کاملAberrant 3′ splice sites in human disease genes: mutation pattern, nucleotide structure and comparison of computational tools that predict their utilization
The frequency distribution of mutation-induced aberrant 3' splice sites (3'ss) in exons and introns is more complex than for 5' splice sites, largely owing to sequence constraints upstream of intron/exon boundaries. As a result, prediction of their localization remains a challenging task. Here, nucleotide sequences of previously reported 218 aberrant 3'ss activated by disease-causing mutations ...
متن کاملSFmap: a web server for motif analysis and prediction of splicing factor binding sites
Alternative splicing (AS) is a post-transcriptional process considered to be responsible for the huge diversity of proteins in higher eukaryotes. AS events are regulated by different splicing factors (SFs) that bind to sequence elements on the RNA. SFmap is a web server for predicting putative SF binding sites in genomic data (http://sfmap.technion.ac.il). SFmap implements the COS(WR) algorithm...
متن کاملSRAMP: prediction of mammalian N6-methyladenosine (m6A) sites based on sequence-derived features
N(6)-methyladenosine (m(6)A) is a prevalent RNA methylation modification involved in the regulation of degradation, subcellular localization, splicing and local conformation changes of RNA transcripts. High-throughput experiments have demonstrated that only a small fraction of the m(6)A consensus motifs in mammalian transcriptomes are modified. Therefore, accurate identification of RNA m(6)A si...
متن کاملIdentifying N6-methyladenosine sites using multi-interval nucleotide pair position specificity and support vector machine
N6-methyladenosine (m6A) refers to methylation of the adenosine nucleotide acid at the nitrogen-6 position. It plays an important role in a series of biological processes, such as splicing events, mRNA exporting, nascent mRNA synthesis, nuclear translocation and translation process. Numerous experiments have been done to successfully characterize m6A sites within sequences since high-resolution...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2005